Super-Scalar Processor Design
نویسنده
چکیده
A super-scalar processor is one that is capable of sustaining an instruction-execution rate of more than one instruction per clock cycle. Maintaining this execution rate is primarily a problem of scheduling processor resources (such as functional units) for high utilrzation. A number of scheduling algorithms have been published, with wide-ranging claims of performance over the single-instruction issue of a scalar processor. However, a number of these claims are based on idealizations or on special-purpose applications. This study uses trace-driven simulation to evaluate many different super-scalar hardware organizations. Super-scalar performance is limited caused by both branch delays and instruction misap rimarily b instruction-fetch inefficiencies rgnment. 8ecause of this instruction-fetch lirnitation, it is not worthwhile to explore highly-concurrent execution hardware, Rather, it is more appro riate to explore economical execution hardware that more closely matches the instructron tLoughput provided b reducing the instruction-fetch inefii the instruction fetcher. This stud ciencies and explores the resulting iI examines techniques for ardware organizatrons. This study concludes that a super-scalar processor can have nearly twice the scalar processor, but that this re uires 1 that four major hardware features: p” xformance of a out-o -order execution, register renarmng, branch pre iction, and a four-instruction decoder. These features are interdependent, and removing any single feature reduces average performance by 18% or more. However, there are many hardware simplifications that cause only a small performance reduction.
منابع مشابه
Super Scalar Processor using Chip Level Optical Interconnections
In this paper we present the design of a super-scalar processor constructed using optoelectronic components interconnected via high-speed free-space optical buses.
متن کاملPaper Special Issue on Novel Vlsi Processor Architecture Processor Pipeline Design for Fast Network Message Han- Dling in Rwc-1 Multiprocessor
In this paper we describe the pipeline design and enhanced hardware for fast message handling in a RICA-1 processor, a processing element (PE) in the RWC-1 multiprocessor. The RWC-1 is based on the reduced inter-processor communication architecture (RICA), in which communications are combined with computation in the processor pipeline. The pipeline is enhanced with hardware mechanisms to suppor...
متن کاملPerformance Test Case Generation for Microprocessors
We describe a systematic methodology for generating performance test cases for current generation microprocessors. Such test cases are used for: (a) validating the expected pipeline flow behavior and timing; and, (b) detecting and diagnosing performance bugs in the design. We cite examples of application to a real, superscalar processor in pre– and post–silicon stages of development. Suggested ...
متن کاملA Simulator for the Hydra CMP
In recent years, Single Chip Multiprocessors have been gaining ground as alternatives to superscalar processor architectures. Comparisons [3] between CMPs and Superscalar Processors argue the case for chip multiprocessors through the results that super scalar architectures outperform CMPs of comparable die size and cost by only a small margin where coarse grained parallelism is not available. O...
متن کاملGenerating evidence for certification of modern processors for use in safety-critical systems
This paper investigates the implications of using a modern super-scalar processor in the safety-critical domain. Firstly, a description of current certification practice and devices is given as background. This is followed by an assessment of how the certification argument and its supporting evidence are affected by the use of a super-scalar processor. Two types of modern processor are consider...
متن کامل